-
Notifications
You must be signed in to change notification settings - Fork 373
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature(cy): add BDQ algorithm #558
Conversation
extend n-step TD; polished;
Codecov Report
@@ Coverage Diff @@
## main #558 +/- ##
==========================================
- Coverage 84.59% 84.44% -0.15%
==========================================
Files 555 556 +1
Lines 45197 45405 +208
==========================================
+ Hits 38233 38344 +111
- Misses 6964 7061 +97
Flags with carried forward coverage won't be shown. Click here to find out more.
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
Description
Add the bdq algorithm and config for hopper and halfcheetah env.
Related Issue
TODO
n-step for BDQ
gradient scaling for backbone network
unify bdq_nstep_td_error and q_nstep_td_error function
unify action_map function
unit Tests writing
Check List